Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

Claims 1-131 are cancelled and claims 132-201 were previously presented in an 
amendment. . 

Claims 1-131 (cancelled) 

132.(amended) A method for generating synthesized acoustical data by 
concatenating audio segments of sounds to reproduce a sequence of concatenated 
sounds/phones wherein each sound/phone comprises three bands including an initial 
co-articulation band, a solo articulation band and a final co-articulation band, and 
each segment comprises one or more bands of a sound /phone , said method 
comprising: 

generating an inventory of audio segments comprising a plurality of 
audio segments comprising bands of one or more sounds/phones; 

establishing an earlier audio segment with at least a portion part of one 
band of a sound/phone selected for including an instance of concatenation; 

establishing a later audio segment with the rest of the portions of the 
and bands of the selected sound/phone wherein at least one of the earlier or 
later audio segments comprises bands of at least two adjacent sound/phones, 
and w herein the solo articulation band of the selected sound/phone is at the 
trailing end of the earlier segment or at the leading end of the later segment 
and at least part of one of the co-articulation bands of the selected 
sound/phone is adjacent to the solo articulation band; and 

concatenating the two audio segments , whereby the concatenated audio 
segments comprise at least three bands of two adjacent sounds/phones . 
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133. (previously presented) The method of claiml32 wherein the solo articulation 
band of the selected sound/phone is at the leading edge of the later audio segment and 
the final co-articulation band of the selected sound/phone is adjacent to the solo 
articulation band. 

134. (previously presented) The method of claim 132 wherein the solo articulation 
band of the selected sound/phone is at the trailing edge of the earlier audio segment 
and the final co-articulation band of the selected sound/phone is at the leading edge of 
the later audio segment. 

135. (previously presented) The method of claiml32 wherein at least a portion of 
one of the co-articulation bands of the selected sound/phone is disposed at an end of 
one of the segments and is opposite the solo articulation band at the end of the other 
segment. 

136. (previously presented) The method of claim 132 wherein the leading band of 
the later audio segment reproduces a static sound and the two audio segments are 
concatenated by overlapping the opposite, adjacent solo and co-articulation bands of 
the selected sound/phone with each other where the transfer function and the length of 
overlap are determined by acoustical data in the two segments. 

137. (previously presented) The method according to claim 136 wherein the static 
phones include vowels, diphthongs, liquids, vibrants, fricatives and nasals. 

138. (previously presented) The method of claim 132 wherein the band in the 
leading edge of the later audio segment reproduces a dynamic sound and the two 
audio segments are concatenated in a non-overlapping manner each other with the 
transfer function determined by acoustical data in the two segments. 

139. (previously presented) The method according to Claim 138 wherein the 
dynamic phones include plosives, affricates, glottal stops, and click sounds. 

140. (previously presented) The method according to Claim 132 wherein the initial 
co-articulation band of the selected sound/phone is disposed in the earlier audio 
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segment and reproduces the properties of the start of the selected sound/phone 
sequence. 

141. (previously presented) The method according to Claim 132 wherein the final 
co-articulation band of the selected sound/phone is disposed in the later audio 
segment and reproduces the properties of the end of the selected sound/phone 
sequence. 

142. (previously presented) The method according to Claim 1 32 wherein voice data 
to be synthesized is combined in groups and each group comprises one or more 
individual audio segments. 

143. (previously presented) The method according to Claim 132 wherein an audio 
segment is established for the later audio segment band comprises the highest number 
of successive portions of the sounds/phones of the sound/phone sequence in order to 
use the smallest number of audio segment bands in the generation of the synthesized 
acoustical data. 

144. (previously presented) The method according to Claim 132 wherein the bands 
of the individual audio segments are processed in accordance with properties of the 
concatenated sound/phone sequence and wherein said properties include one or more 
of the group consisting of a modification of frequency, duration, amplitude, and 
spectrum. 

145. (previously presented) The method according to Claim 132 wherein the bands 
of individual audio segments are processed in accordance with properties of the 
selected band wherein the instance of concatenation lies, with these properties 
including one or more of the group of properties consisting of frequency, duration, 
amplitude, and spectrum. 

146. (previously presented) ' The method according to Claim 132 wherein the 
instance of concatenation is set in the bands of the selected sound/phone where at 
least two bands are in agreement with respect to one or more properties of the group 
of properties consisting of zero point, amplitude, gradients, derivatives of any degree, 
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spectra, tone levels, amplitude values within a frequency band, volume, style of 
speech, and emotion of speech. 

147. (previously presented) The method according to Claim 132 wherein the 
acoustical data to be synthesized comprises voice data, and the sounds are phones. 

148. (previously presented) The method according to Claim 132 wherein the 
synthesized acoustical data is converted to acoustical signals and/or voice signals. 

149. (previously presented) The method of Claim 132 wherein the instance of 
concatenation is disposed within or at an end of one of the co-articulation bands. 

150. (amended) A device for generating synthesized acoustical data by 
concatenating audio segments of sounds to reproduce a sequence of concatenated 
sounds/phones from sounds/phones that include an initial co-articulation band, a solo 
articulation band and a final co-articulation band, comprising: 

segment providing means (107/108) for providing audio segments . said 
segments comprising bands of one or more sounds/phones; 

establishing means (105) for establishing at least two audio segments from the 
segment providing means, said establishing means selecting an earlier audio 
segment having at least a portion part of a band of one band of the a selected 
sound/phone and a later audio segment with the rest of the portions of the a nd 
bands of the selected sound/phone, wherein at least one of the earlier or later 
audio segments comprises bands of at least two adjacent sounds/phones, and 
said earlier audio segment having a solo articulation band of the selected 
sound/phone at the trailing end of the earlier segment or at the leading end of 
the later segment, at least part of one of the co-articulation bands of the 
selected sound/phone is adjacent to the solo articulation band and said selected 
sound/phone having an instance of concatenation 

means for determining the duration and position of bands in the audio segments 
depending on the earlier and later audio segments; and 
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means for concatenating (1 1 1) the two audio segments at an instance of 
concatenation within the selected sound/phone and as a function of properties of 
the bands at the trailing end of the earlier segment and at the leading end of the 
later segment , whereby the concatenated audio segments comprise at least three 
bands of two adjacent sounds/phones . 

1 5 1 .(previously presented) The device of claim 1 50 wherein the means for 
providing audio segments comprises a database (107) for storing in which audio 
segments are stored, each of which reproducing portion of a phone or portions of a 
sequence of (concatenated) phones or a synthesis means (108) for supplying audio 
segments or any combination of said database and said synthesis means. 

152. (amended) The device of claim 150 wherein the solo articulation band of 
the selected sound/phone is at the leading edge of the later audio segment and the 
final co-articulation band of the selected sound/phone is adjacent to the solo 
articulation band. 

153. (previously presented) The device of claim 1 50 wherein the solo articulation 
band of the selected sound/phone is at the trailing edge of the earlier audio segment 
and the final co-articulation band of the selected sound/phone is at the leading edge of 
the later audio segment. 

154. (previously presented) The device of claim 150 wherein at least a portion of 
one of the co-articulation bands of the selected sound/phone is disposed at an end of 
one of the segments and is opposite the solo articulation band at the end of the other 
segment. 

155. (previously presented) The device of claim 150 wherein said concatenating 
means overlaps the leading band of the later audio segment having a static sound with 
the trailing band of the earlier audio segment and the transfer function and the length 
of overlap are determined by acoustical data in the two segments. 
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15 6. (previously presented) The device of Claim 155 wherein the static phones 
include vowels, diphthongs, liquids, vibrants, fricatives and nasals. 

157. (previously presented) The device of Claim 150 wherein said concatenating 
means concatenates the audio segments in a non-overlapped manner when the band in 
the leading edge of the later audio segment reproduces a dynamic sound with the 
transfer function determined by acoustical data in the two segments. 

15 8. (previously presented) The device according to Claim 157 wherein the 
dynamic phones include plosives, affricates, glottal stops, and click sounds. 

159. (previously presented) The device according to Claim 150 wherein the 
selection means (105) selects audio segments which reproduce the greatest number of 
successive portions of concatenated phones of the concatenated phone sequence. 

160. (previously presented) The device according to Claim 150 wherein the 
concatenation means (111) comprises means for processing the bands of individual 
audio segments depending on properties of the concatenated phone sequence and with 
one or more functions selected from the group consisting of modification of 
frequency, duration, amplitude, and spectrum. 

161. (previously presented) The device according to Claim 150 wherein the 
concatenation means (111) comprises means for processing the bands of individual 
audio segments with one or more functions in a band selected from the group 
consisting of the instance of concatenation, modification of frequency, duration, 
amplitude, and spectrum. 

162. (previously presented) The device according to Claim 150 wherein the 
concatenation means (111) sets the instance of concatenation where at least two bands 
are in agreement with respect to one or more properties of the group of properties 
consisting of zero point, amplitude, gradients, derivatives of any degree, spectra, tone 
levels, amplitude values within a frequency band, volume, style of speech, and 
emotion of speech. 
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163. (previously presented) The device according to Claim 150 characterized in that 
wherein the segment providing means includes audio segments with bands, each of 
which reproduces at least a portion of a sound or phone, respectively, a sound or 
phone, respectively, portions of phone sequences or polyphones, respectively, or 
sound sequences or polyphones, respectively. 

164. (previously presented) The device according to Claim 150 wherein the 
concatenation means (111) generates synthesized voice data by means of the 
concatenation of audio segments. 

165. (previously presented) The device according to Claim 150 wherein further 
comprising means (117) for converting synthesized acoustical data to acoustical 
signals and/or voice signals. 



166.(amended) A data carrier which includes a computer program for the co- 
articulation specific concatenation of audio segments in order to generate synthesized 
acoustical data which reproduces a sequence of concatenated phones, wherein each 
sound/phone comprises three bands including an initial co-articulation band, a solo 
articulation band and a final co-articulation band, and each segment comprises one or 
more bands of a sound /phone , comprising the following steps: 

establishing an earlier audio segment with at least a portion part of one band of a 
sound/phone selected for including an instance of concatenation; 

establishing a later audio segment with the rest of the portions of the a nd bands of the 
selected sound/phone wherein at least one of the earlier or later audio segments 
comprises bands of at least two adjacent sounds/phones, and wherein the solo 
articulation band of the selected sound/phone is at the trailing end of the earlier 
segment or at the leading end of the later segment and at least part of one of the co- 
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articulation bands of the selected sound/phone is adjacent to the solo articulation 
band; and 

concatenating the two audio segments whereby the concatenated audio 
segments comprise at least three bands of two adjacent sounds/phones . 

167. (amended) The data carrier of Claim 166 wherein the solo articulation 
band of the selected sound/phone is at the leading edge of the later audio segment and 
the final co-articulation band of the selected sound/phone is adjacent to the solo 
articulation band. 

168. (previously presented) The data carrier of Claim 166 wherein the solo 
articulation band of the selected sound/phone is at the trailing edge of the earlier audio 
segment and the final co-articulation band of the selected sound/phone is at the 
leading edge of the later audio segment. 

169. (previously presented) The data carrier of Claim 166 wherein at least a portion 
of one of the co-articulation bands of the selected sound/phone is disposed at an end 
of one of the segments and is opposite the solo articulation band at the end of the 
other segment. 

1 70. (previously presented) The data carrier of Claim 1 66 wherein the leading band 
of the later audio segment reproduces a static sound and the two audio segments are 
concatenated by overlapping the opposite, adjacent solo and co-articulation bands of 
the selected sound/phone with each other where the transfer function and the length of 
overlap are determined by acoustical data in the two segments. 

1 7 1 .(previously presented) The data carrier according to Claim 1 70 wherein the 
static phones include vowels, diphthongs, liquids, vibrants, fricatives and nasals. 

172.(previously presented) The data carrier of Claim 166 wherein the band in the 
leading edge of the later audio segment reproduces a dynamic sound and the two 
audio segments are concatenated in a non-overlapping manner each other with the 
transfer function determined by acoustical data in the two segments. 
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173. (previously presented) The data carrier according to Claim 172 wherein the 
dynamic phones include plosives, affricates, glottal stops, and click sounds. 

174. (previously presented) The data carrier according to Claim 166 wherein the 
initial co-articulation band of the selected sound/phone is disposed in the earlier audio 
segment and reproduces the properties of the start of the selected sound/phone 
sequence. 

175. (previously presented) The data carrier according to Claim 166 wherein the 
final co-articulation band of the selected sound/phone is disposed in the later audio 
segment and reproduces the properties of the end of the selected sound/phone 
sequence. 

176. (previously presented) The data carrier according to Claim 166 wherein voice 
data to be synthesized is combined in groups and each group comprises one or more 
individual audio segments. 

177. (previously presented) The data carrier according to Claim 166 wherein an 
audio segment is established for the later audio segment band comprises the highest 
number of successive portions of the sounds/phones of the sound/phone sequence in 
order to use the smallest number of audio segment bands in the generation of the 
synthesized acoustical data. 

178. (previously presented) The data carrier according to Claim 166 wherein the 
bands of the individual audio segments are processed in accordance with properties of 
the concatenated sound/phone sequence and wherein said properties include one or 
more of the group consisting of a modification of frequency, duration, amplitude, and 
spectrum. 

179. (previously presented) The data carrier according to Claim 166 wherein the 
bands of individual audio segments are processed in accordance with properties of the 
selected band wherein the instance of concatenation lies, with these properties 
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including one or more of the group of properties consisting of frequency, duration, 
amplitude, and spectrum. 

180.(previously presented) The data carrier according to Claim 166 wherein the 
instance of concatenation is set in the bands of the selected sound/phone where at 
least two bands are in agreement with respect to one or more properties of the group 
of properties consisting of zero point, amplitude, gradients, derivatives of any degree, 
spectra, tone levels, amplitude values within a frequency band, volume, style of 
speech, and emotion of speech. 

1 8 1 .(previously presented) The data carrier according to Claim 1 66 wherein data to 
be synthesized comprises voice data, and the sounds are phones. 

182. (previously presented) The data carrier according to Claim 166 wherein the 
synthesized data is converted to acoustical signals and/or voice signals. 

183. (previously presented) The data carrier of Claim 166 wherein the instance of 
concatenation is disposed within or at an end of one of the co-articulation bands. 

184. (previously presented) The data carrier of Claim 166 wherein data is stored as 
acoustical data, optical data, magnetic data or electrical data. 

185. (previously presented) The data carrier of claim 184 wherein a group of the 
audio segments reproduces sounds or phones, respectively, or portions of sounds or 
phones, respectively. 

1 86. (previously presented) The data carrier of Claim 1 84 wherein a group of the 
audio segments reproduces phone sequences or portions of phone sequences or 
polyphones, respectively, or portions of polyphones. 

187. (amended) A synthesized voice signal comprising a sequence of sounds or 
phones with the voice signals comprising segments of sounds to reproduce a sequence 
of concatenated sounds/phones wherein each sound/phone comprises three bands 
including an initial co-articulation band, a solo articulation band and a final co- 
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articulation band, and each segment comprises one or more bands of a sound, said 

synthesized voice signals comprising: 

at least two audio segments concatenated for providing the synthesized 
voice signal, said two audio segments including an earlier audio segment with 
at least a part of one band of a sound/phone selected for including an instance 
of concatenation and a later audio segment with the rest of the portions and 
bands of the selected sound/phone wherein at least one of the earlier or later 
audio segments comprises bands of at least two adjacent sounds/phones, and 

wherein the solo articulation band of the selected sound/phone is at the 
trailing end of the earlier segment or at the leading end of the later segment 
and at least part of one of the co-articulation bands of the selected 
sound/phone is adjacent to the solo articulation band and the two audio 
segments are concatenated to provide the synthesized voice signal , whereby 
the concatenated audio segments comprise at least three bands of two adjacent 
sounds/phones . 

1 88. (amended) The synthesized voice signal of Claim 1 87 wherein the solo 
articulation band of the selected sound/phone is at the leading edge of the later audio 
segment and the final co-articulation band of the selected sound/phone is adjacent to 
the solo articulation band. 

189. (previously presented) The synthesized voice signal of claims 187 wherein the 
solo articulation band of the selected sound/phone is at the trailing edge of the earlier 
audio segment and the final co-articulation band of the selected sound/phone is at the 
leading edge of the later audio segment. 

190. (previously presented) The synthesized voice signal of Claim 187 wherein at 
least a portion of one of the co-articulation bands of the selected sound/phone is 
disposed at an end of one of the segments and is opposite the solo articulation band at 
the end of the other segment. 

1 9 1 .(previously presented) The synthesized voice signal of Claim 1 87 wherein the 
leading band of the later audio segment reproduces a static sound and the two audio 
segments are concatenated by overlapping the opposite, adjacent solo and co- 
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articulation bands of the selected sound/phone with each other where the transfer 
function and the length of overlap are determined by acoustical data in the two 
segments. 

192. (previously presented) The method according to Claim 191 wherein the static 
phones include vowels, diphthongs, liquids, vibrants, fricatives and nasals. 

193. (previously presented) The synthesized voice signal of Claim 187 wherein the 
band in the leading edge of the later audio segment reproduces a dynamic sound and 
the two audio segments are concatenated in a non-overlapping manner each other with 
the transfer function determined by acoustical data in the two segments. 

194. (previously presented) The synthesized voice signal of claim 193 wherein the 
dynamic phones include plosives, affricates, glottal stops, and click sounds. 

195. (previously presented) The synthesized voice signal of Claim 1 87 wherein the 
initial co-articulation band of the selected sound/phone is disposed in the earlier audio 
segment and reproduces the properties of the start of the selected sound/phone 
sequence. 

196. (previously presented) The synthesized voice signal of Claim 1 87 wherein the 
final co-articulation band of the selected sound/phone is disposed in the later audio 
segment and reproduces the properties of the end of the selected sound/phone 
sequence. 

1 97. (previously presented) The synthesized voice signal of Claim 1 87 wherein 
voice data to be synthesized is combined in groups and each group comprises one or 
more individual audio segments. 

198. (previously presented) The synthesized voice signal of Claim 187 wherein an 
audio segment is established for the later audio segment band comprises the highest 
number of successive portions of the sounds/phones of the sound/phone sequence in 
order to use the smallest number of audio segment bands in the generation of the 
synthesized acoustical data. 
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199. (previously presented) The synthesized voice signal of Claim 187 wherein the 
acoustical data to be synthesized comprises voice data, and the sounds are phones. 

200. (previously presented) The synthesized voice signal of Claim 187 wherein the 
synthesized acoustical data is converted to acoustical signals and/or voice signals. 

20 1 .(previously presented) The synthesized voice signal of Claim 1 87 wherein the 
instance of concatenation is disposed within or at an end of one of the co-articulation 
bands. 
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